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DETAILED ACTION 

Response to Arguments 

1 . Applicant's arguments filed 08/13/07 have been fully considered but they are not 
persuasive. 

Applicant argues that Shinoda et al., do not teach selecting a state having the 
Gaussian distribution number whose description length is minimum for every state 
(Amendment, page 5 - 7). 

The examiner disagrees, Shinoda et al., disclose computing the description 
length of each node in the Gaussian distribution tree structure; and selecting a node set 
which makes the description length minimum (page 4, lines 2 - A). Computing a 
description length for each node and selecting the one that with minimum description 
length imply selecting a state having the Gaussian distribution number whose 
description length is minimum for every state. 

2. Applicant argues that Shinoda et al., do not teach time correlation of data (HMM) 
used for MDL calculation and speech data for learning (Amendment, page 7). 

The examiner disagrees, Shinoda et al., disclose that when a set of models 
(1,...,i,..,,l) and data Xn = (x1,...,xn) are given, a description length I i ( x N ) applying 
model i is defined by 

I i ( x N ) = - log P A ( i ) ( x N ) + ip 2 log N + log I, whereG(i) represents the 
maximum likelihood estimate of the parameter 
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9(i)= theta.. sub. 1. sup. (i),.theta.. sub.. beta. i. sup. (i) (page 2, section 3, lines 1 -7). 
Determining maximum likelihood for the parameter of model i implies using correlation 
of data (HMM) used for MDL calculation, since every xn is assigned beforehand to one 
Gaussian component in the HMM using the Viterbi algorithm. 

3. Applicant argues that Shinoda et al., do not teach syllable HMMs (Amendment, 
page 7). 

The examiner disagrees, Shinoda et al., disclose phonological features in HMM 
(page 5, lines 3 - 7). Phonological features in HMM implies syllable HMMs. 

4. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(a) the invention was known or used by others in this country, or patented or described in a printed 
publication in this or a foreign country, before the invention thereof by the applicant for a patent. 

5. Claims 1- 9 are rejected under 35 U.S.C. 102(a) as being anticipated by Shinoda 
et al., (HMM Size Reduction using MDL Criterion, Japan, March 2002). 

As per claim 1, Shinoda et al., teach an acoustic model creating method of 
creating an HMM (Hidden Markov Model) by optimizing, for each state, Gaussian 
distribution numbers of the respective states constituting the HMM and retraining the 
optimized HMM using training speech data, the method comprising: 

setting plural types of the Gaussian distribution numbers from a predetermined 
value to a maximum distribution number for each of the plurality of states constituting 
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the HMM ("model having a large number of distributions trained with sufficient training 
data"; page 1 ; introduction, lines 1 9 - 23); 

computing a description length for each of the plurality of states having the plural 
types of Gaussian distribution numbers using a Minimum Description Length criterion 
("Minimum Description Length criterion minimum is selected for each state"; page 1; 
introduction, lines 23, and 24); 

selecting a state having the Gaussian distribution number whose description 
length is minimum, for every state (page 4, line 4); 

constructing the HMM in accordance with the state having the Gaussian 
distribution number whose description length is minimum, selected for every state ("re- 
estimate all HMM parameters"), and retraining the constructed HMM using the training 
speech data ("re-training HMM"; page 4, lines 5-7; page 5, lines 8-10); and 

performing speech recognition using the retrained HMM (page 4, section 5, lines 
1, and 2) 

As per claim 2, Shinoda et al., further disclose that for the Minimum Description 
Length criterion, a description length li(xN) using a model i when a model set {1, . . . , i, . 
. . , I} and data xN={x1 , . . . , xN} (N being a data length) are given is expressed as the 
following general equation, 

I i ( x N ) = - log P A ( i ) ( x N ) + ip 2 log N + log I . 

0(i): parameter of model i . 
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9(i)=maximum likelihood estimate of .theta..sub.1 .sup. (i),.theta.. sub. .beta. i. sup. (i) 

p (i): dimension (degree of freedom) of model i 
and in the general equation that computes the description length, the model set {1 , ... , 
i, .... I} is considered as a set of states in which plural types of the Gaussian 
distribution numbers from a predetermined value to the maximum distribution number 
are set for a predetermined state in a predetermined HMM, where, when the number of 
types of the Gaussian distribution numbers is I (I is an integer satisfying l>2), then 1, . . . 
, i, .... I are symbols that specify the respective distribution number types from a first 
type to an l-th type, and the general equation is used as an equation for computing the 

description length of the state having an i-th type of distribution number out of 1 i f . 

. I (page 2, section 3 - page 3, line 11). 

As per claims 3, and 4, Shinoda et al. , further disclose that the second term on 
the right side of the equation being multiplied by a weighting coefficient alpha, and the 
third term on the right side being omitted ("the second term is multiplied by penalty 
coefficient a"; page 3, lines 10, and 11). 

As per claim 5, Shinoda et al., further disclose that xN being a set of the 

respective training speech data obtained by matching in time series (x1 ,xN) a 

plurality of the training speech data with the respective states of the HMMs for every 
state, using the HMMs in which the respective states have any one of the Gaussian 
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distribution numbers from the predetermined value to the maximum distribution number 
("large number of distributions"; page 2, section 3, lines 1, and 2; page 1, lines 19 - 23). 

As per claim 6, Shinoda et al., further disclose that the Gaussian distribution 
numbers being the maximum distribution number ("large number of distributions"; page 
1, lines 19-23). 

As per claim 7, Shinoda et al., further disclose that the HMMs being syllable 
HMMs ("phonological"; page 5, lines 3 - 7). 

As per claim 8, Shinoda et al., further disclose that the syllable HMMs having the 
same consonant out of the states constituting the syllable HMMs tie an initial state or at 
least two states including an initial state in the syllable HMMs, and the syllable HMMs 
having the same vowel tie a final state of the states having self loops or at least two 
states including the final state in the syllable HMMs ("initial model"; page 4, section 4, 
line 5 - page 4, line 7) 

As per claim 9, Shinoda et al., further disclose that a speech recognition device 
recognizes input speech using HMMs (Hidden Markov Models) as acoustic models for 
feature data obtained by feature analysis of the input speech, the HMMs created by the 
acoustic model creating method according to claim 1 being used as the HMMs which 
are the acoustic models ("used acoustic feature"; page 4, section 5, lines 1 - 6). 
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Conclusion 

6. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 .1 36(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 

7. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Leonard Saint-Cyr whose telephone number is (571) 

272- 4247. The examiner can normally be reached on Mon- Friday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571) 272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is (571)- 

273- 8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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